# First-person video understanding
Eilev Blip2 Flan T5 Xl
MIT
A vision-language model optimized for first-person perspective videos, employing EILEV's innovative training method to stimulate in-context learning capabilities
Image-to-Text
Transformers English

E
kpyu
135
1
Eilev Blip2 Opt 2.7b
MIT
A first-person perspective optimized vision-language model trained on BLIP-2-OPT-2.7B, employing the innovative EILEV method to stimulate in-context learning capabilities
Image-to-Text
Transformers English

E
kpyu
214
4
Featured Recommended AI Models